Reading order detection on handwritten documents
نویسندگان
چکیده
Abstract Recent advances in Handwritten Text Recognition and Document Layout Analysis have made it possible to convert digital images of manuscripts into electronic text. However, providing this text with the correct structure context is still an open problem that needs be solved actually enable extracting relevant information conveyed by The most important needed for a set elements their reading order. Most studies on order are rule-based approaches focus printed documents. Much less attention has been paid so far handwritten documents, where becomes particularly important—and challenging. In work, we propose new approach automatically determine regions lines task approached as sorting order-relation operator learned from examples. We experimentally demonstrate effectiveness our method three different datasets at hierarchical levels.
منابع مشابه
Annoflow - Handwritten Annotation and Proof- reading on Dynamic Digital Documents
Phil Crosby Michael Quinn François Guimbretière Department of Computer Science Human-Computer Interaction Lab University of Maryland, College Park, MD, 20742 [email protected] [email protected] [email protected] ABSTRACT Proof-reading digital documents is a difficult task, because the ink annotations made on documents do not maintain their relevance as the document changes. In addition, applyi...
متن کاملRepudiation Detection in Handwritten Documents
Forensic document verification presents a different and interesting set of challenges as opposed to traditional writer identification and verification tasks using natural handwriting. The handwritten data presented to a forensic examiner is often deliberately altered, in addition to being limited in quantity. Specifically, the alterations can be either forged, where one imitates another person’...
متن کاملText line detection in handwritten documents
Article history: Received 13 April 2007 Received in revised form 26 March 2008
متن کاملConnected Component Based Word Spotting on Persian Handwritten image documents
Word spotting is to make searchable unindexed image documents by locating word/words in a doc-ument image, given a query word. This problem is challenging, mainly due to the large numberof word classes with very small inter-class and substantial intra-class distances. In this paper, asegmentation-based word spotting method is presented for multi-writer Persian handwritten doc-...
متن کاملOn Segmentation Methods for Handwritten Arabic Documents
In the literature, two methods for the extraction zones of the document are more used. The first method is based on the Mathematical Morphology (MM). The second is based on Hough Transform (HT). The main contribution of this paper is the application of these methods to extract the handwritten components of the complex document. The second contribution is the combination between the HT and the M...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Neural Computing and Applications
سال: 2022
ISSN: ['0941-0643', '1433-3058']
DOI: https://doi.org/10.1007/s00521-022-06948-5